[Day 16] Metrics / 評估指標 - Calssification - iT 邦幫忙::一起幫忙解決難題，拯救 IT 人的一天

第 11 屆 iThome 鐵人賽

DAY 16

AI & Data

跟top kaggler學習如何贏得資料分析競賽系列第 16 篇

[Day 16] Metrics / 評估指標 - Calssification

11th鐵人賽

madeleine

2019-09-17 22:25:54

1254 瀏覽

分享至

代號說明

soft label 柔性標記 - 自 Well-Trained 模型蒸餾出有用的知識
hard label 剛性標記 - 傳統 1 或 0 表示法

Accuracy

Best Constant : predict the most frequent class

舉例

Dataset : 10 cakes, 90 eggs
Predict always eggs : accuracy = 0.9!

Logarithmic loss (logloss)

屬於 soft prediction

截圖自 Coursera

Best Constant : set αi to frequency of i-th class

舉例

Dataset : 10 cakes, 90 eggs
α = [0.1, 0.9]

Area Under Curve (AUC ROC)

pair = (red object, green object)

TP : true positive, FP : false positive
截圖自 Coursera

Best constants - All constants give same score

Cohen's Kappa motivation

屬於 hard prediction

截圖自 Coursera

舉例

Dataset : 10 cakes, 90 eggs
Baseline accuracy=0.9

predict 20 cakes and 80 eggs at random : accuracy ~0.74
0.2 * 0.1 + 0.8 * 0.9 = 0.74
error ~ 0.26

截圖自 Coursera

[Day 15] Metrics / 評估指標 - Regression metrics

[Day 17] Metrics optimization / 評估指標最佳化 - Regression

系列文

跟top kaggler學習如何贏得資料分析競賽共 30 篇

RSS系列文訂閱系列文

21 人訂閱

完整目錄

直播研討會

{{ item.channelVendor }} {{ item.webinarstarted }} |

直播中

尚未有邦友留言

立即登入留言

跟top kaggler學習如何贏得資料分析競賽 系列 第 16 篇